Method and system for transferring data to improve responsiveness when sending large data sets

ABSTRACT

Most of the internet traffic today is carried out via the Transmission Control Protocol (TCP). The main advantage of TCP is that it provides reliable data transfer to the application layer and simplifies programming. The protocol maximizes data throughput but may also lead to noticeable transmission delay in wide area networks (WAN). A client-server based medical image viewing system is disclosed that achieves high data throughput over TCP without impacting responsiveness. Special timestamp messages inserted into the data stream allow the system to detect situations where network latency increases noticeably and to obtain a reliable estimate of sustained transfer bandwidth. The system applies a feedback scheme that avoids network delays by limiting send bandwidth. In addition other parameters, in particular image compression settings, are dynamically adjusted depending on current network quality.

PRIORITY CLAIM

This application is a continuation of (1) U.S. application Ser. No. 15/361,038 entitled ‘METHOD AND SYSTEM FOR TRANSFERRING DATA TO IMPROVE RESPONSIVENESS WHEN SENDING LARGE DATA SETS’, inventors D Stalling et al., filed Nov. 24, 2016 which is a continuation of and claims priority to (2) U.S. application Ser. No. 13/831,982 entitled ‘METHOD AND SYSTEM FOR TRANSFERRING DATA TO IMPROVE RESPONSIVENESS WHEN SENDING LARGE DATA SETS’, inventors D Stalling et al., filed Mar. 15, 2013 which issued Nov. 29, 2016 as U.S. Pat. No. 9,509,802, the teachings of (1) and (2) are explicitly incorporated herein by reference in their entireties.

FIELD OF INVENTION

The invention pertains to digital data processing and, more particularly, by way of example, to the transferring of data between a client and a server and has application to areas including medical imaging, atmospheric studies, astrophysics, microscopy, spectroscopy, satellite imaging and geophysics.

BACKGROUND

Many computer applications today demand high network bandwidth over the internet. Good examples are systems that download large amount of data such as files, music or videos. Most of the internet traffic today is carried out via the Transmission Control Protocol (TCP). The main advantage of TCP is that it provides reliable data transfer to the application layer. The application does not have to deal with lost data packets, corrupted data packets, or out-of-order arrival of packets. All types of error detection and retransmission algorithms are already implemented in the TCP protocol. Also, sophisticated methods for congestion avoidance and flow control have been added to the TCP protocol. Most of these methods are intended to optimize bandwidth, i.e., data throughput, over a network.

Maximized data throughput usually comes at the price of increased latency. For example, a common technique is to not send out small pieces of data immediately but to wait until more data is available, so that larger packets can be sent then (e.g. Nagle algorithm). This increases bandwidth but also introduces extra delay. Another approach is to send out large amounts of data before getting an acknowledgement by the receiver. This also increases bandwidth but at the same time may increase the time a data packet is in transfer.

For many applications maximum bandwidth is by far the most important criterion. Increased latency is often not a problem. This is not true for applications like voice over Internet Protocol (IP) or teleconferencing. Here low response times, i.e. low latency, are crucial. These applications usually disable the Nagle algorithm or do not use TCP at all. Often bandwidth requirements are not that high for such applications.

Another class of applications requires both high bandwidth and low latency. This is true for example for a client-server based medical image viewer. Such a system needs to display large amounts of image data which are streamed from the server to the client. Often it is advisable to send images before they are requested by the client such as in traditional streaming applications. For example, if a doctor looks at the first image of a 3D image series then it is likely that she will also look at the second image soon. But if the doctor proceeds, some images that are scheduled for later streaming suddenly have to be transferred immediately, or images have to be rendered on the server and then displayed on the client as soon as possible. Thus it is important that the server stays always responsive and that new data can be sent as quickly as possible to the client based on current user interaction.

A general aspect of network based applications is that often not all parameters of the network are known, or can be influenced by the application. For example routers or other network devices between the endpoints may introduce latencies and buffers that are not application controlled. Often the network has to be regarded a black box.

SUMMARY OF THE INVENTION

In an embodiment of the present invention, a client-server based medical image viewing system that sends data over a standard TCP connection in such a way that high data throughput is achieved without impacting responsiveness. Special timestamp messages inserted into the data stream allow the system to detect situations where network latency increases noticeably and to obtain a reliable estimate of sustained transfer bandwidth. In an embodiment of the present invention, the system applies a feedback scheme that avoids network delays by limiting send bandwidth. In various embodiments of the present invention, other parameters, in particular image compression settings, can be dynamically adjusted depending on current network latency.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a diagram showing a simple network model with filled buffer;

FIG. 2 is a diagram showing a simple network model with empty buffer; and

FIG. 3 is flowchart showing an overview of transferring timestamp messages and other data between the client and server.

DESCRIPTION OF THE INVENTION

Definitions

The transitional term “comprising” is synonymous with “including,” “containing,” or “characterized by,” is inclusive or open-ended and does not exclude additional, unrecited elements or method steps.

The transitional phrase “consisting of” excludes any element, step, or ingredient not specified in the claim, but does not exclude additional components or steps that are unrelated to the invention such as impurities ordinarily associated with a composition.

The transitional phrase “consisting essentially of” limits the scope of a claim to the specified materials or steps and those that do not materially affect the basic and novel characteristic(s) of the claimed invention.

The term “bandwidth” and “send bandwidth” refer to various bit-rate measures, representing the available or consumed data communication resources expressed in bits per second or multiples of it.

The term “adaptive bandwidth management” means methods that continuously adjust the amount of data that is sent into a network per time in order to avoid or reduce network congestion and transfer delay . . . .

The term “buffer” or “network buffer” refers to a temporary storage area acting as a holding area, enabling the computer or network to manipulate data before transferring it to a device.

The term “client-server” refers to a computer system that selectively shares its resources; a client is a computer or computer program that initiates contact with a server in order to make use of a resource. This sharing of computer resources allows multiple people to use a computer server at the same time. Because a computer does a limited amount of work at any moment, a time-sharing system must quickly prioritize its tasks to accommodate the clients. Clients and servers exchange messages in a request-response messaging pattern: The client sends a request, and the server returns a response.

The term “application layer” or “application-level protocol” refers to the communications between computers. To communicate, the computers must have a common language, and they must follow rules so that both the client and the server know what to expect. The language and rules of communication are defined in a communications protocol. All client-server protocols operate in the application layer.

The term “lossy compression” refers to a data encoding method that compresses data by discarding or losing some of it. The procedure aims to minimize the amount of data that needs to be held, handled, and/or transmitted by a computer.

The term “network latency” can be measured either ‘one-way’ as the time taken for the source to send a packet to a destination or ‘round-trip’ from the one-way latency from source to destination plus the one-way latency from the destination back to the source.

The term “pseudo code” is an informal high-level description of the operating principle of a computer program or other algorithm.

The term “timestamp message” refers to a message that contains an indication of a point in time on either the server or the client, or the difference between two such points in time. Timestamp messages may be exchanged between client and server in both directions.

The term “Transmission Control Protocol” or TCP includes using a “congestion window” to determine how many packets can be sent at one time. The larger the congestion window size, the higher the throughput. The TCP “slow start” and “congestion avoidance” algorithms determine the size of the congestion window. The maximum congestion window is related to the amount of buffer space that the kernel allocates for each socket.

In the following description, various aspects of the present invention will be described. However, it will be apparent to those skilled in the art that the present invention may be practiced with only some or all aspects of the present invention. For purposes of explanation, specific numbers, materials, and configurations are set forth in order to provide a thorough understanding of the present invention. However, it will be apparent to one skilled in the art that the present invention may be practiced without the specific details. In other instances, well-known features are omitted or simplified in order not to obscure the present invention.

Parts of the description will be presented in data processing terms, such as data, selection, retrieval, generation, and so forth, consistent with the manner commonly employed by those skilled in the art to convey the substance of their work to others skilled in the art. As is well understood by those skilled in the art, these quantities (data, selection, retrieval, generation) take the form of electrical, magnetic, or optical signals capable of being stored, transferred, combined, and otherwise manipulated through electrical, optical, and/or biological components of a processor and its subsystems.

Various operations will be described as multiple discrete steps in turn, in a manner that is most helpful in understanding the present invention; however, the order of description should not be construed as to imply that these operations are necessarily order dependent.

Various embodiments will be illustrated in terms of exemplary classes and/or objects in an object-oriented programming paradigm. It will be apparent to one skilled in the art that the present invention can be practiced using any number of different classes/objects, not merely those included here for illustrative purposes. Furthermore, it will also be apparent that the present invention is not limited to any particular software programming language or programming paradigm.

The invention is illustrated by way of example and not by way of limitation in the figures of the accompanying drawings in which like references indicate similar elements. It should be noted that references to ‘an’ or ‘one’ embodiment in this disclosure are not necessarily to the same embodiment, and such references mean at least one.

A render server program is described in U.S. application Ser. No. 13/831,967, entitled “Multi-User Mult-GPU Render Server Apparatus and Methods”, inventors M. Westerhoff etr al, which was filed Mar. 15, 2013, is herein expressly incorporated by reference in its entirety. A rule based render server program is described in U.S. application Ser. No. 13/831,975, entitled “Method and System for Rule-Based Display of Sets of Images”, inventors M. Westerhoff etr al, which was filed Mar. 15, 2013, is herein expressly incorporated by reference in its entirety.

In one embodiment of the present invention, a client-server based medical image viewing system uses the TCP protocol for data transfer, but at the same time avoids network congestion and thus achieves both high data throughput and low latency. The system is non-intrusive in that it does not change the TCP implementation and does not make use of special network drivers. Instead, the transport layer is considered as a black box and only the actual performance of the network is monitored. Based on the monitoring results different actions are taken by the application itself in order to cope with the current network quality.

Water Pipe Model

The following analogy helps to illustrate an embodiment of the present invention. Assume that a network behaves like a system of water pipes of different cross-sections. Somewhere inside the system there are “water barrels” or network buffers that can fill up as shown in FIG. 1. Initially a large amount of “water” or data can be pumped into the network. “Inflow” or send bandwidth is high 120, but “outflow” or “read bandwidth” on the client side might be much smaller 130. In effect the “water barrels” or network buffers 100 fill up 110. It then takes a long time until a new “drop of water” or data packet can pass into the network. Latency has increased and the server is not able to respond quickly to user input. The result is that the “pipes” or connections are congested.

In order to keep the server responsive it is important to prevent the network buffers 100 from filling up 110 as depicted in FIG. 2. In an embodiment of the invention, the server can only send as much data into the network 120 as the thinnest pipe or weakest connection can convey 130. Note, that the overall throughput or bandwidth is not decreased if send bandwidth is limited.

Detecting Latency Increase

In an embodiment of the present invention, the system uses its own message-based protocol that is transported over a TCP connection. In this embodiment, all benefits of TCP are retained for ease of use and reliability. Small timestamp messages are sent from the server to the client and back from the client to the server. FIG. 3 is a flowchart depicting an overview of the message-based protocol and timestamp messages. These timestamp messages allow an estimate of the current network bandwidth to determine when network latency will increase due to congestion or decrease due to de-congestion. In FIG. 3 the server is on the left 300 and the client is on the right 350. The server has a transfer queue 360 that holds multiple data packets 310 queued for transfer. In addition, the client may request 340 extra packets 330 from the server. Data packets requested by the client have precedence 305 over packets originating from the transfer queue. Inserted into the data stream are timestamp messages 320 which are first sent from the server to the client and then returned by the client 355.

A timestamp message that the server sends to the client only contains the time ‘t’ in milliseconds since the server was started. In addition, the server stores the timestamp message in a First In, First Out (FIFO) queue. Together with the message the server also stores the total number of bytes ‘c’ that were sent to the client up to that point in time, as well as the current send bandwidth b_(send) in bytes/sec.

In an embodiment of the present invention, every timestamp message that arrives at the client is immediately sent back to the server. In an embodiment of the present invention, the order of messages is preserved. In an embodiment of the present invention, messages that are sent back to the server contain the difference ‘d’ between the client time (measured in milliseconds since client was started) and the server time ‘t’ that was contained in the incoming timestamp message. In an embodiment of the present invention, it is not required that clocks on server and client are synchronized, i.e., that both clocks were started at the same time.

In an embodiment of the present invention, the smallest value d_(min) that occurs in any of the timestamp messages that arrive back at the server defines a baseline for detecting increased latency. Without synchronized clocks it is difficult if not impossible to determine how long it really takes for a message to pass from the server to the client. However, it is possible to determine how much more travel time was needed for an arbitrary message compared to the fastest message. This increase of travel time or delay is given by e=d−d_(min). If ‘e’ increases significantly it is apparent that network buffers are filling up and that send bandwidth must be reduced.

Estimating Bandwidth

In an embodiment of the present invention, a key requirement for the system to be able to choose a reasonable send bandwidth and to adjust other application settings to network quality is a reliable estimate of sustained transfer bandwidth. An estimate is computed as follows:

If a timestamp message arrives back at the server, it is taken out of the FIFO queue. The time that was spent on the client between receiving the last two timestamp messages is given by: T=t _(i) −t _(i−1) +d _(i) −d _(i−1)

The amount of data C that was read in that time is given by the number of bytes that were sent between the last two timestamp messages: C=c _(i) −c _(i−1)

From these quantities the read bandwidth at the client is determined as: b _(read) =C/T.

In an embodiment of the present invention, if send bandwidth b_(send) is significantly larger than read bandwidth b_(read) (e.g. by more than 30%) we assume that the network is saturated and that b_(read) is a good estimate of transfer bandwidth. In an embodiment of the present invention, a running average is computed of multiple (e.g. 10) such b_(read) samples in order to obtain a best estimate b_(est) of transfer bandwidth. In an unexpected result, in order to quickly get reliable results, especially shortly after the client was started and the network is not yet saturated, it turned out to be beneficial to also include b_(read) samples into the running average if they are significantly larger than the current best estimate (e.g. by more than 40%). Further, in an embodiment of the present invention, outliers can be discarded by clamping b_(read) so that it does not exceed twice the current best estimate b_(est).

Limiting Send Bandwidth

In an embodiment of the present invention, a good estimate b_(est) of sustained transfer bandwidth allows the transfer to be slowed in case latency increases noticeably. In an embodiment of the present invention, send bandwidth is limited if the delay ‘e’ exceeds a certain threshold e_(max). In an embodiment of the present invention, send bandwidth is limited when e is greater than approximately 40 msec. In an alternative embodiment of the present invention, send bandwidth is limited when e is greater than approximately 50 msec. When calculating ‘e’ approximately refers to plus or minus twenty percent. In an embodiment of the present invention, a bandwidth limit b_(limit) of approximately sixty (60) percent of b_(est) is enforced when ‘e’ exceeds e_(max). In an alternative embodiment of the present invention, a bandwidth limit b_(limit) of approximately seventy (70) percent of b_(est) is enforced when ‘e’ exceeds e_(max). When calculating ‘b’ approximately refers to plus or minus twenty percent. In an embodiment of the present invention, if delay ‘e’ later drops below e_(max), the bandwidth limit is gradually lifted again by incrementing the current limit by a value that is increased if extra latency is reduced.

In various embodiments of the present invention, extra safeguards can be incorporated into the scheme in order to make it more robust against measurement errors and noise. In an embodiment of the present invention, bandwidth is not reduced if there are less than 10 KB of data in the line. In an embodiment of the present invention, the number of bytes in the line can be estimated by c−c_(i), where c is the current total number of bytes that were sent to the client up to that point in time and c_(i) is the total number of bytes that were sent to the client at the time the current timestamp message was sent. In an embodiment of the present invention, if a bandwidth limit is already active it is never reduced by more than 50%.

The resulting feedback scheme leads to a transmission rate on the server side that constantly oscillates around the estimated sustained transfer bandwidth b_(est). Usually oscillation frequency is higher if the total latency between server and client is lower. This is because timestamp messages return earlier at the server, and thus the server can adjust transmission rate more quickly. On higher latency connections oscillation frequency is lower, and amplitude of latency oscillation is greater. In various embodiments of the present invention, the overall behavior of the feedback scheme can be tuned by varying the different parameters. In practice, the values stated above turned out to work very well for different kinds of networks ranging from metropolitan area networks, domestic connections, and intercontinental lines.

Feedback scheme pseudo code:

  if (e > e_(max)) then  if (number of bytes in line > threshold)  then   b_(limit) := max(Factor₁ * b_(est), Factor₂ * b_(limit))  end else if (b_(read) > Factor₃ * b_(limit)) then  d := Factor₄ * b_(limit) * (e_(max) − e) /e_(max)  b_(limit) := b_(limit) + d end Adjusting Compression Settings and Buffering

In an embodiment of the present invention, the server always stays responsive by limiting send bandwidth. In an embodiment of the present invention, if a large number of data files (e.g., images) are queued for transfer on the server, when the client requests a large data file (e.g., a new image) or some other information be delivered instantly, then this new data can be sent without significant extra delay as seen in FIG. 3.

In an embodiment of the present invention, a good estimate of transfer bandwidth also allows the application to dynamically adjust other settings to current network quality, like image compression settings. If network bandwidth is poor, the application can react to that occurrence. For a single-stream application, such as streaming a single video or audio channel, the compression ratio can simply be adjusted such that the resulting bandwidth is slightly under the available bandwidth, which combined with buffering yields the desired result.

In another embodiment of the present invention, in an interactive visualization application, adjustable lossy compression can be applied in a similar manner in order to achieve smooth interaction. Image quality might be degraded, but images can still be displayed very quickly. Higher quality versions of the images can be resent later and the view can be refined. It is not obvious though, how buffering can be applied, because the interaction is not known ahead of time.

An example for such an application is a client server system to display medical image studies. Medical image studies can consist of multiple images that can be organized in multiple series. It is desirable to be able to view these images in a multi-viewport layout on the client computer. As the user looks at a series of images, the user will interact with the images, e.g., scrolling, rotating panning or zooming. It is not known in advance, in which direction a user will scroll, or if multiple image series exist, which of these the user will look at first. The same is true for any other interaction with the scene, such as rotation of a 3D volume rendering.

Another embodiment of the present invention monitors the current user interaction and allows the application to anticipate the next views to be streamed. These views are then streamed to the client and buffered, so that they can be displayed without delay.

For example if a user looks at and interacts with a viewport displaying one image series (“Current Series”), images from that series will more likely be displayed next than images from other series. Thus these images will be streamed to a buffer on the client side first. The order is determined by the distance of images to the currently displayed image in the sorting order of the series: The closest image will be streamed first. The same concept applies to other types of displays and other types of interaction. For example if a 3D volume rendered view of a data set is shown and the user currently rotates the model about e.g. the X-axis, then from the current view, the next views can be anticipated and pre-streamed and buffered locally.

In another embodiment of the present invention, if the user stops rotating, then some views that may have been buffered already may need to be discarded from the buffer, but that is typically a small number compared to the whole sequence. In order to use these techniques in interactive applications, a high-degree of responsiveness is required to avoid lags when the user decides to change e.g. scrolling or rotation direction or starts interacting with another view port. This responsiveness is achieved by the adaptive bandwidth management as described above.

TABLE 1 Symbols and Meaning Symbol Description T Server time in msec (since session was started) C Total number of bytes that were sent from server to client b_(send) Send bandwidth at server b_(read) Read bandwidth at client b_(est) Estimate of sustained transfer bandwidth b_(limit) Bandwidth limit on server side (send) D Difference between client and server time when TS arrives at client E Extra travel time for messages sent from server to client (delay)

While the present invention has been described in some detail for purposes of clarity and understanding, one skilled in the art will appreciate that various changes in form and detail can be made without departing from the true scope of the invention. All figures, tables, and appendices, as well as patents, applications, and publications, referred to above, are hereby incorporated by reference.

Aspects of the Invention

In an embodiment of the invention, a method of identifying network latency comprising the steps of sending a request for image data from a client computer, including inserting a first timestamp message into the request for image data at an application level, transferring the request and the first timestamp message to a server using a standard Transmission Communications Protocol (TCP) connection, receiving the first timestamp message at the server, calculating a first time for the first timestamp message to be sent from the client computer to the server, sending one or more second timestamp messages from the server to the client computer, returning the one or more second timestamp messages from the client computer to the server, receiving the one or more second timestamp messages from the client computer at the server, calculating a second time and subsequent times taken for the one or more second timestamp messages to be sent from the client computer to the server, calculating an estimate of current network bandwidth based on one or more of the first time, the second time and the subsequent times and using the estimate of current network bandwidth to determine network latency.

In an embodiment of the invention, a method of identifying network latency comprising the steps of sending a request for image data from a client computer, including inserting a first timestamp message into the request for image data at an application level, transferring the request and the first timestamp message to a server using a standard Transmission Communications Protocol (TCP) connection, receiving the first timestamp message at the server, calculating a first time for the first timestamp message to be sent from the client computer to the server, sending one or more second timestamp messages from the server to the client computer, returning the one or more second timestamp messages from the client computer to the server, receiving the one or more second timestamp messages from the client computer at the server, calculating a second time and subsequent times taken for the one or more second timestamp messages to be sent from the client computer to the server, calculating an estimate of current network bandwidth based on one or more of the first time, the second time and the subsequent times and using the estimate of current network bandwidth to determine network latency, where the estimate of current network bandwidth is calculated from a difference between the first time and the second time or the second time and the subsequent times.

In an embodiment of the invention, a method of identifying network latency comprising the steps of sending a request for image data from a client computer, including inserting a first timestamp message into the request for image data at an application level, transferring the request and the first timestamp message to a server using a standard Transmission Communications Protocol (TCP) connection, receiving the first timestamp message at the server, calculating a first time for the first timestamp message to be sent from the client computer to the server, sending one or more second timestamp messages from the server to the client computer, returning the one or more second timestamp messages from the client computer to the server, receiving the one or more second timestamp messages from the client computer at the server, calculating a second time and subsequent times taken for the one or more second timestamp messages to be sent from the client computer to the server, calculating an estimate of current network bandwidth based on one or more of the first time, the second time and the subsequent times and using the estimate of current network bandwidth to determine network latency, where the estimate of current network bandwidth is compared with a minimum network bandwidth.

In an embodiment of the invention, a method of identifying network latency comprising the steps of sending a request for image data from a client computer, including inserting a first timestamp message into the request for image data at an application level, transferring the request and the first timestamp message to a server using a standard Transmission Communications Protocol (TCP) connection, receiving the first timestamp message at the server, calculating a first time for the first timestamp message to be sent from the client computer to the server, sending one or more second timestamp messages from the server to the client computer, returning the one or more second timestamp messages from the client computer to the server, receiving the one or more second timestamp messages from the client computer at the server, calculating a second time and subsequent times taken for the one or more second timestamp messages to be sent from the client computer to the server, calculating an estimate of current network bandwidth based on one or more of the first time, the second time and the subsequent times and using the estimate of current network bandwidth to determine network latency, where the estimate of current network bandwidth is compared with a minimum network bandwidth, further comprising refining the estimate of current network bandwidth based on a comparison.

In an embodiment of the invention, a method of identifying network latency comprising the steps of sending a request for image data from a client computer, including inserting a first timestamp message into the request for image data at an application level, transferring the request and the first timestamp message to a server using a standard Transmission Communications Protocol (TCP) connection, receiving the first timestamp message at the server, calculating a first time for the first timestamp message to be sent from the client computer to the server, sending one or more second timestamp messages from the server to the client computer, returning the one or more second timestamp messages from the client computer to the server, receiving the one or more second timestamp messages from the client computer at the server, calculating a second time and subsequent times taken for the one or more second timestamp messages to be sent from the client computer to the server, calculating an estimate of current network bandwidth based on one or more of the first time, the second time and the subsequent times and using the estimate of current network bandwidth to determine network latency, where information related to one or more of the first timestamp message and the one or more second timestamp messages is not included in the first timestamp message, where the information is stored in a first in first out queue, so that the information can be evaluated when the first timestamp message arrives back at the server.

In an embodiment of the invention, a method of identifying network latency comprising the steps of sending a request for image data from a client computer, including inserting a first timestamp message into the request for image data at an application level, transferring the request and the first timestamp message to a server using a standard Transmission Communications Protocol (TCP) connection, receiving the first timestamp message at the server, calculating a first time for the first timestamp message to be sent from the client computer to the server, sending one or more second timestamp messages from the server to the client computer, returning the one or more second timestamp messages from the client computer to the server, receiving the one or more second timestamp messages from the client computer at the server, calculating a second time and subsequent times taken for the one or more second timestamp messages to be sent from the client computer to the server, calculating an estimate of current network bandwidth based on one or more of the first time, the second time and the subsequent times and using the estimate of current network bandwidth to determine network latency, where one or more of the first timestamp message and the one or more second timestamp messages are used to compute read bandwidth (C/T), where C=c_(i)−c_(i−1), where C is an amount of data C that was read in a time between a last two timestamp messages (c_(i), c_(i−1)), and T=t_(i)−t_(i−1)+d_(i)−d_(i−1), where d_(i) and d_(i−1) are client time and t_(i)−t_(i−1) are server time in a last two incoming timestamp messages.

In an embodiment of the invention, a method of identifying network latency comprising the steps of sending a request for image data from a client computer, including inserting a first timestamp message into the request for image data at an application level, transferring the request and the first timestamp message to a server using a standard Transmission Communications Protocol (TCP) connection, receiving the first timestamp message at the server, calculating a first time for the first timestamp message to be sent from the client computer to the server, sending one or more second timestamp messages from the server to the client computer, returning the one or more second timestamp messages from the client computer to the server, receiving the one or more second timestamp messages from the client computer at the server, calculating a second time and subsequent times taken for the one or more second timestamp messages to be sent from the client computer to the server, calculating an estimate of current network bandwidth based on one or more of the first time, the second time and the subsequent times and using the estimate of current network bandwidth to determine network latency, where one or more of the first timestamp message and the one or more second timestamp messages are used to compute read bandwidth (C/T), where C=c_(i)−c_(i−1), where C is an amount of data C that was read in a time between a last two timestamp messages (c_(i), c_(i−1)), and T=t_(i)−t_(i−1)+d_(i)−d_(i−1), where d_(i) and d_(i−1) are client time and t_(i)−t_(i−1) are server time in a last two incoming timestamp messages, further comprising determining a running average, where samples of read bandwidth are combined into the running average.

In an embodiment of the invention, a method of identifying network latency comprising the steps of sending a request for image data from a client computer, including inserting a first timestamp message into the request for image data at an application level, transferring the request and the first timestamp message to a server using a standard Transmission Communications Protocol (TCP) connection, receiving the first timestamp message at the server, calculating a first time for the first timestamp message to be sent from the client computer to the server, sending one or more second timestamp messages from the server to the client computer, returning the one or more second timestamp messages from the client computer to the server, receiving the one or more second timestamp messages from the client computer at the server, calculating a second time and subsequent times taken for the one or more second timestamp messages to be sent from the client computer to the server, calculating an estimate of current network bandwidth based on one or more of the first time, the second time and the subsequent times and using the estimate of current network bandwidth to determine network latency, where one or more of the first timestamp message and the one or more second timestamp messages are used to compute read bandwidth (C/T), where C=c_(i)−c_(i−1), where C is an amount of data C that was read in a time between a last two timestamp messages (c_(i), c_(i−1)), and T=t_(i)−t_(i−1)+d_(i)−d_(i−1), where d_(i) and d_(i−1) are client time and t_(i)−t_(i−1) are server time in a last two incoming timestamp messages, further comprising determining a running average, where samples of read bandwidth are combined into the running average, where samples are excluded from the running average when send bandwidth is less than between a lower limit of approximately 20 percent of read bandwidth and an upper limit of approximately 40 percent of read bandwidth.

In an alternative embodiment of the present invention, a method of minimizing network latency comprises the steps of sending a request for image data from a client computer, including inserting one or more first timestamp messages into the request for image data at an application level, transferring the request and the one or more first timestamp messages to a server using a standard Transmission Communications Protocol (TCP) connection, receiving the one or more first timestamp messages at the server, calculating a first time for the one or more first timestamp messages sent from the client computer to the server, sending one or more second timestamp messages from the server to the client computer, sending the one or more second timestamp messages from the client computer to the server, receiving the one or more second timestamp messages from the client computer at the server, calculating a second time and subsequent times taken for the one or more second timestamp messages to be sent from the client computer to the server, calculating an estimate of current network bandwidth based on one or more of the first time, the second time and the subsequent times, determining situations where network latency will otherwise increase due to congestion and using the estimate of current network bandwidth to one or both reduce and modify client computer requests to minimize network latency.

In an alternative embodiment of the present invention, a method of minimizing network latency comprises the steps of sending a request for image data from a client computer, including inserting one or more first timestamp messages into the request for image data at an application level, transferring the request and the one or more first timestamp messages to a server using a standard Transmission Communications Protocol (TCP) connection, receiving the one or more first timestamp messages at the server, calculating a first time for the one or more first timestamp messages sent from the client computer to the server, sending one or more second timestamp messages from the server to the client computer, sending the one or more second timestamp messages from the client computer to the server, receiving the one or more second timestamp messages from the client computer at the server, calculating a second time and subsequent times taken for the one or more second timestamp messages to be sent from the client computer to the server, calculating an estimate of current network bandwidth based on one or more of the first time, the second time and the subsequent times, determining situations where network latency will otherwise increase due to congestion and using the estimate of current network bandwidth to one or both reduce and modify client computer requests to minimize network latency, where the estimate of current network bandwidth is calculated from a difference between the first time and the second time or the second time and the subsequent times.

In an alternative embodiment of the present invention, a method of minimizing network latency comprises the steps of sending a request for image data from a client computer, including inserting one or more first timestamp messages into the request for image data at an application level, transferring the request and the one or more first timestamp messages to a server using a standard Transmission Communications Protocol (TCP) connection, receiving the one or more first timestamp messages at the server, calculating a first time for the one or more first timestamp messages sent from the client computer to the server, sending one or more second timestamp messages from the server to the client computer, sending the one or more second timestamp messages from the client computer to the server, receiving the one or more second timestamp messages from the client computer at the server, calculating a second time and subsequent times taken for the one or more second timestamp messages to be sent from the client computer to the server, calculating an estimate of current network bandwidth based on one or more of the first time, the second time and the subsequent times, determining situations where network latency will otherwise increase due to congestion and using the estimate of current network bandwidth to one or both reduce and modify client computer requests to minimize network latency, where the estimate of current network bandwidth is compared with a minimum network bandwidth.

In an alternative embodiment of the present invention, a method of minimizing network latency comprises the steps of sending a request for image data from a client computer, including inserting one or more first timestamp messages into the request for image data at an application level, transferring the request and the one or more first timestamp messages to a server using a standard Transmission Communications Protocol (TCP) connection, receiving the one or more first timestamp messages at the server, calculating a first time for the one or more first timestamp messages sent from the client computer to the server, sending one or more second timestamp messages from the server to the client computer, sending the one or more second timestamp messages from the client computer to the server, receiving the one or more second timestamp messages from the client computer at the server, calculating a second time and subsequent times taken for the one or more second timestamp messages to be sent from the client computer to the server, calculating an estimate of current network bandwidth based on one or more of the first time, the second time and the subsequent times, determining situations where network latency will otherwise increase due to congestion and using the estimate of current network bandwidth to one or both reduce and modify client computer requests to minimize network latency, where the estimate of current network bandwidth is compared with a minimum network bandwidth, further comprising refining the estimate of current network bandwidth based on a comparison between times.

In an alternative embodiment of the present invention, a method of minimizing network latency comprises the steps of sending a request for image data from a client computer, including inserting one or more first timestamp messages into the request for image data at an application level, transferring the request and the one or more first timestamp messages to a server using a standard Transmission Communications Protocol (TCP) connection, receiving the one or more first timestamp messages at the server, calculating a first time for the one or more first timestamp messages sent from the client computer to the server, sending one or more second timestamp messages from the server to the client computer, sending the one or more second timestamp messages from the client computer to the server, receiving the one or more second timestamp messages from the client computer at the server, calculating a second time and subsequent times taken for the one or more second timestamp messages to be sent from the client computer to the server, calculating an estimate of current network bandwidth based on one or more of the first time, the second time and the subsequent times, determining situations where network latency will otherwise increase due to congestion and using the estimate of current network bandwidth to one or both reduce and modify client computer requests to minimize network latency, where information related to one or both the one or more first timestamp messages and the one or more second timestamp messages is not included in the one or more first timestamp messages and the one or more second timestamp messages, where the information is stored in a first in first out queue, so that the information can be evaluated when the one or more first timestamp messages and the one or more second timestamp messages arrives back at the server.

In an alternative embodiment of the present invention, a method of minimizing network latency comprises the steps of sending a request for image data from a client computer, including inserting one or more first timestamp messages into the request for image data at an application level, transferring the request and the one or more first timestamp messages to a server using a standard Transmission Communications Protocol (TCP) connection, receiving the one or more first timestamp messages at the server, calculating a first time for the one or more first timestamp messages sent from the client computer to the server, sending one or more second timestamp messages from the server to the client computer, sending the one or more second timestamp messages from the client computer to the server, receiving the one or more second timestamp messages from the client computer at the server, calculating a second time and subsequent times taken for the one or more second timestamp messages to be sent from the client computer to the server, calculating an estimate of current network bandwidth based on one or more of the first time, the second time and the subsequent times, determining situations where network latency will otherwise increase due to congestion and using the estimate of current network bandwidth to one or both reduce and modify client computer requests to minimize network latency, where information related to one or both the one or more first timestamp messages and the one or more second timestamp messages is not included in the one or more first timestamp messages and the one or more second timestamp messages, where the information is stored in a first in first out queue, so that the information can be evaluated when the one or more first timestamp messages and the one or more second timestamp messages arrives back at the server, where one or both the one or more first timestamp messages and the one or more second timestamp messages are used to compute read bandwidth (C/T), where C=c_(i)−c_(i−1), where C is an amount of data C that was read in a time between a last two timestamp messages (c_(i), c_(i−1)), and T=t_(i)−t_(i−1)+d_(i)−d_(i−1) is the time elapsed on the client computer between the last two timestamp messages, where t_(i)−t_(i−1) is the time elapsed on the server between the last two timestamp messages, and d_(i) and d_(i−1) are differences between client computer time and server time when the messages arrived at the client computer.

In an alternative embodiment of the present invention, a method of minimizing network latency comprises the steps of sending a request for image data from a client computer, including inserting one or more first timestamp messages into the request for image data at an application level, transferring the request and the one or more first timestamp messages to a server using a standard Transmission Communications Protocol (TCP) connection, receiving the one or more first timestamp messages at the server, calculating a first time for the one or more first timestamp messages sent from the client computer to the server, sending one or more second timestamp messages from the server to the client computer, sending the one or more second timestamp messages from the client computer to the server, receiving the one or more second timestamp messages from the client computer at the server, calculating a second time and subsequent times taken for the one or more second timestamp messages to be sent from the client computer to the server, calculating an estimate of current network bandwidth based on one or more of the first time, the second time and the subsequent times, determining situations where network latency will otherwise increase due to congestion and using the estimate of current network bandwidth to one or both reduce and modify client computer requests to minimize network latency, where information related to one or both the one or more first timestamp messages and the one or more second timestamp messages is not included in the one or more first timestamp messages and the one or more second timestamp messages, where the information is stored in a first in first out queue, so that the information can be evaluated when the one or more first timestamp messages and the one or more second timestamp messages arrives back at the server, where one or both the one or more first timestamp messages and the one or more second timestamp messages are used to compute read bandwidth (C/T), where C=c_(i)−c_(i−1), where C is an amount of data C that was read in a time between a last two timestamp messages (c_(i), c_(i−1)), and T=t_(i)−t_(i−1)+d_(i)−d_(i−1) is the time elapsed on the client computer between the last two timestamp messages, where t_(i)−t_(i−1) is the time elapsed on the server between the last two timestamp messages, and d_(i) and d_(i−1) are differences between client computer time and server time when the messages arrived at the client computer, further comprising determining a running average, where samples of read bandwidth are combined into the running average.

In an alternative embodiment of the present invention, a method of minimizing network latency comprises the steps of sending a request for image data from a client computer, including inserting one or more first timestamp messages into the request for image data at an application level, transferring the request and the one or more first timestamp messages to a server using a standard Transmission Communications Protocol (TCP) connection, receiving the one or more first timestamp messages at the server, calculating a first time for the one or more first timestamp messages sent from the client computer to the server, sending one or more second timestamp messages from the server to the client computer, sending the one or more second timestamp messages from the client computer to the server, receiving the one or more second timestamp messages from the client computer at the server, calculating a second time and subsequent times taken for the one or more second timestamp messages to be sent from the client computer to the server, calculating an estimate of current network bandwidth based on one or more of the first time, the second time and the subsequent times, determining situations where network latency will otherwise increase due to congestion and using the estimate of current network bandwidth to one or both reduce and modify client computer requests to minimize network latency, where information related to one or both the one or more first timestamp messages and the one or more second timestamp messages is not included in the one or more first timestamp messages and the one or more second timestamp messages, where the information is stored in a first in first out queue, so that the information can be evaluated when the one or more first timestamp messages and the one or more second timestamp messages arrives back at the server, where one or both the one or more first timestamp messages and the one or more second timestamp messages are used to compute read bandwidth (C/T), where C=c_(i)c_(i−1), where C is an amount of data C that was read in a time between a last two timestamp messages (c_(i), c_(i−1)), and T=t_(i)−t_(i−1)+d_(i)−d_(i−1) is the time elapsed on the client computer between the last two timestamp messages, where t_(i)−t_(i−1) is the time elapsed on the server between the last two timestamp messages, and d_(i) and d_(i−1) are differences between client computer time and server time when the messages arrived at the client computer, further comprising determining a running average, where samples of read bandwidth are combined into the running average, where samples are excluded from the running average when send bandwidth is less than between a lower limit of approximately 130 percent of the read bandwidth measured at a time a timestamp message arrived at the client computer and an upper limit of approximately 140 percent of the estimate of current network bandwidth.

In an alternative embodiment of the present invention, a method of minimizing network latency comprises the steps of sending a request for image data from a client computer, including inserting one or more first timestamp messages into the request for image data at an application level, transferring the request and the one or more first timestamp messages to a server using a standard Transmission Communications Protocol (TCP) connection, receiving the one or more first timestamp messages at the server, calculating a first time for the one or more first timestamp messages sent from the client computer to the server, sending one or more second timestamp messages from the server to the client computer, sending the one or more second timestamp messages from the client computer to the server, receiving the one or more second timestamp messages from the client computer at the server, calculating a second time and subsequent times taken for the one or more second timestamp messages to be sent from the client computer to the server, calculating an estimate of current network bandwidth based on one or more of the first time, the second time and the subsequent times, determining situations where network latency will otherwise increase due to congestion and using the estimate of current network bandwidth to one or both reduce and modify client computer requests to minimize network latency, where information related to one or both the one or more first timestamp messages and the one or more second timestamp messages is not included in the one or more first timestamp messages and the one or more second timestamp messages, where the information is stored in a first in first out queue, so that the information can be evaluated when the one or more first timestamp messages and the one or more second timestamp messages arrives back at the server, where one or both the one or more first timestamp messages and the one or more second timestamp messages are used to compute read bandwidth (C/T), where C=c_(i)−c_(i−1), where C is an amount of data C that was read in a time between a last two timestamp messages (c_(i), c_(i−1)), and T=t_(i)−t_(i−1)+d_(i)−d_(i−1) is the time elapsed on the client computer between the last two timestamp messages, where t_(i)−t_(i−1) is the time elapsed on the server between the last two timestamp messages, and d_(i) and d_(i−1) are differences between client computer time and server time when the messages arrived at the client computer, further comprising determining a running average, where samples of read bandwidth are combined into the running average, where a bandwidth limit is applied on the server in order to avoid network delays, where the bandwidth limit is computed using a feedback scheme.

In an alternative embodiment of the present invention, a method of minimizing network latency comprises the steps of sending a request for image data from a client computer, including inserting one or more first timestamp messages into the request for image data at an application level, transferring the request and the one or more first timestamp messages to a server using a standard Transmission Communications Protocol (TCP) connection, receiving the one or more first timestamp messages at the server, calculating a first time for the one or more first timestamp messages sent from the client computer to the server, sending one or more second timestamp messages from the server to the client computer, sending the one or more second timestamp messages from the client computer to the server, receiving the one or more second timestamp messages from the client computer at the server, calculating a second time and subsequent times taken for the one or more second timestamp messages to be sent from the client computer to the server, calculating an estimate of current network bandwidth based on one or more of the first time, the second time and the subsequent times, determining situations where network latency will otherwise increase due to congestion and using the estimate of current network bandwidth to one or both reduce and modify client computer requests to minimize network latency, where information related to one or both the one or more first timestamp messages and the one or more second timestamp messages is not included in the one or more first timestamp messages and the one or more second timestamp messages, where the information is stored in a first in first out queue, so that the information can be evaluated when the one or more first timestamp messages and the one or more second timestamp messages arrives back at the server, where one or both the one or more first timestamp messages and the one or more second timestamp messages are used to compute read bandwidth (C/T), where C=c_(i)c_(i−1), where C is an amount of data C that was read in a time between a last two timestamp messages (c_(i), c_(i−1)), and T=t_(i)−t_(i−1)+d_(i)−d_(i−1) is the time elapsed on the client computer between the last two timestamp messages, where t_(i)−t_(i−1) is the time elapsed on the server between the last two timestamp messages, and d_(i) and d_(i−1) are differences between client computer time and server time when the messages arrived at the client computer, further comprising determining a running average, where samples of read bandwidth are combined into the running average, where a bandwidth limit is applied on the server in order to avoid network delays, where the bandwidth limit is computed using a feedback scheme, where the feedback scheme uses a pseudo code.

In an alternative embodiment of the present invention, a method of minimizing network latency comprises the steps of sending a request for image data from a client computer, including inserting one or more first timestamp messages into the request for image data at an application level, transferring the request and the one or more first timestamp messages to a server using a standard Transmission Communications Protocol (TCP) connection, receiving the one or more first timestamp messages at the server, calculating a first time for the one or more first timestamp messages sent from the client computer to the server, sending one or more second timestamp messages from the server to the client computer, sending the one or more second timestamp messages from the client computer to the server, receiving the one or more second timestamp messages from the client computer at the server, calculating a second time and subsequent times taken for the one or more second timestamp messages to be sent from the client computer to the server, calculating an estimate of current network bandwidth based on one or more of the first time, the second time and the subsequent times, determining situations where network latency will otherwise increase due to congestion and using the estimate of current network bandwidth to one or both reduce and modify client computer requests to minimize network latency, where information related to one or both the one or more first timestamp messages and the one or more second timestamp messages is not included in the one or more first timestamp messages and the one or more second timestamp messages, where the information is stored in a first in first out queue, so that the information can be evaluated when the one or more first timestamp messages and the one or more second timestamp messages arrives back at the server, where one or both the one or more first timestamp messages and the one or more second timestamp messages are used to compute read bandwidth (C/T), where C=c_(i)−c_(i−1), where C is an amount of data C that was read in a time between a last two timestamp messages (c_(i), c_(i−1)), and T=t_(i)−t_(i−1)+d_(i)−d_(i−1) is the time elapsed on the client computer between the last two timestamp messages, where t_(i)−t_(i−1) is the time elapsed on the server between the last two timestamp messages, and d_(i) and d_(i−1) are differences between client computer time and server time when the messages arrived at the client computer, further comprising determining a running average, where samples of read bandwidth are combined into the running average, where a bandwidth limit is applied on the server in order to avoid network delays, where the bandwidth limit is computed using a feedback scheme, where the feedback scheme uses a pseudo code, where the pseudo code includes an expression

  if (e > emax) then  if (number of bytes in line > threshold)  then   blimit := max(Factor1 * best, Factor2 * blimit)  end else if (bread > Factor3 * blimit) then  d := Factor4 * blimit * (emax − e)/emax  blimit := blimit + d end.

In an alternative embodiment of the present invention, a method of minimizing network latency comprises the steps of sending a request for image data from a client computer, including inserting one or more first timestamp messages into the request for image data at an application level, transferring the request and the one or more first timestamp messages to a server using a standard Transmission Communications Protocol (TCP) connection, receiving the one or more first timestamp messages at the server, calculating a first time for the one or more first timestamp messages sent from the client computer to the server, sending one or more second timestamp messages from the server to the client computer, sending the one or more second timestamp messages from the client computer to the server, receiving the one or more second timestamp messages from the client computer at the server, calculating a second time and subsequent times taken for the one or more second timestamp messages to be sent from the client computer to the server, calculating an estimate of current network bandwidth based on one or more of the first time, the second time and the subsequent times, determining situations where network latency will otherwise increase due to congestion and using the estimate of current network bandwidth to one or both reduce and modify client computer requests to minimize network latency, where information related to one or both the one or more first timestamp messages and the one or more second timestamp messages is not included in the one or more first timestamp messages and the one or more second timestamp messages, where the information is stored in a first in first out queue, so that the information can be evaluated when the one or more first timestamp messages and the one or more second timestamp messages arrives back at the server, where one or both the one or more first timestamp messages and the one or more second timestamp messages are used to compute read bandwidth (C/T), where C=c_(i)−c_(i−1), where C is an amount of data C that was read in a time between a last two timestamp messages (c_(i), c_(i−1)), and T=t_(i)−t_(i−1)+d_(i)−d_(i−1) is the time elapsed on the client computer between the last two timestamp messages, where t_(i)−t_(i−1) is the time elapsed on the server between the last two timestamp messages, and d_(i) and d_(i−1) are differences between client computer time and server time when the messages arrived at the client computer, further comprising determining a running average, where samples of read bandwidth are combined into the running average, where a bandwidth limit is applied on the server in order to avoid network delays, where the bandwidth limit is computed using a feedback scheme, where the bandwidth limit is used to compute a lossy compression rate, where the lossy compression rate is calculated in order to achieve a desired interactive speed, where the feedback scheme uses a pseudo code, where the lossy compression rate is used to stream compressed images with a compression ratio, where the pseudo code includes an expression

  if (e > emax) then  if (number of bytes in line > threshold)  then   blimit := max(Factor1 * best, Factor2 * blimit)  end else if (bread > Factor3 * blimit) then  d := Factor4 * blimit * (emax − e)/emax  blimit := blimit + d end.

In an alternative embodiment of the present invention, a method of minimizing network latency comprises the steps of sending a request for image data from a client computer, including inserting one or more first timestamp messages into the request for image data at an application level, transferring the request and the one or more first timestamp messages to a server using a standard Transmission Communications Protocol (TCP) connection, receiving the one or more first timestamp messages at the server, calculating a first time for the one or more first timestamp messages sent from the client computer to the server, sending one or more second timestamp messages from the server to the client computer, sending the one or more second timestamp messages from the client computer to the server, receiving the one or more second timestamp messages from the client computer at the server, calculating a second time and subsequent times taken for the one or more second timestamp messages to be sent from the client computer to the server, calculating an estimate of current network bandwidth based on one or more of the first time, the second time and the subsequent times, determining situations where network latency will otherwise increase due to congestion and using the estimate of current network bandwidth to one or both reduce and modify client computer requests to minimize network latency, where information related to one or both the one or more first timestamp messages and the one or more second timestamp messages is not included in the one or more first timestamp messages and the one or more second timestamp messages, where the information is stored in a first in first out queue, so that the information can be evaluated when the one or more first timestamp messages and the one or more second timestamp messages arrives back at the server, where one or both the one or more first timestamp messages and the one or more second timestamp messages are used to compute read bandwidth (C/T), where C=c_(i)−c_(i−1), where C is an amount of data C that was read in a time between a last two timestamp messages (c_(i), c_(i−1)), and T=t_(i)−t_(i−1)+d_(i)−d_(i−1) is the time elapsed on the client computer between the last two timestamp messages, where t_(i)−t_(i−1) is the time elapsed on the server between the last two timestamp messages, and d_(i) and d_(i−1) are differences between client computer time and server time when the messages arrived at the client computer, further comprising determining a running average, where samples of read bandwidth are combined into the running average, where a bandwidth limit is applied on the server in order to avoid network delays, where the bandwidth limit is computed using a feedback scheme, where the bandwidth limit is used to compute a lossy compression rate, where the lossy compression rate is calculated in order to achieve a desired interactive speed, where the feedback scheme uses a pseudo code, where the lossy compression rate is used to stream compressed images with a compression ratio, where images are streamed to the client computer using a buffering system, where the pseudo code includes an expression

  if (e > emax) then  if (number of bytes in line > threshold)  then   blimit := max(Factor1 * best, Factor2 * blimit)  end else if (bread > Factor3 * blimit) then  d := Factor4 * blimit * (emax − e)/emax  blimit := blimit + d end.

In an alternative embodiment of the present invention, a method of minimizing network latency comprises the steps of sending a request for image data from a client computer, including inserting one or more first timestamp messages into the request for image data at an application level, transferring the request and the one or more first timestamp messages to a server using a standard Transmission Communications Protocol (TCP) connection, receiving the one or more first timestamp messages at the server, calculating a first time for the one or more first timestamp messages sent from the client computer to the server, sending one or more second timestamp messages from the server to the client computer, sending the one or more second timestamp messages from the client computer to the server, receiving the one or more second timestamp messages from the client computer at the server, calculating a second time and subsequent times taken for the one or more second timestamp messages to be sent from the client computer to the server, calculating an estimate of current network bandwidth based on one or more of the first time, the second time and the subsequent times, determining situations where network latency will otherwise increase due to congestion and using the estimate of current network bandwidth to one or both reduce and modify client computer requests to minimize network latency, where information related to one or both the one or more first timestamp messages and the one or more second timestamp messages is not included in the one or more first timestamp messages and the one or more second timestamp messages, where the information is stored in a first in first out queue, so that the information can be evaluated when the one or more first timestamp messages and the one or more second timestamp messages arrives back at the server, where one or both the one or more first timestamp messages and the one or more second timestamp messages are used to compute read bandwidth (C/T), where C=c_(i)−c_(i−1), where C is an amount of data C that was read in a time between a last two timestamp messages (c_(i), c_(i−1)), and T=t_(i)−t_(i−1)+d_(i)−d_(i−1) is the time elapsed on the client computer between the last two timestamp messages, where t_(i)−t_(i−1) is the time elapsed on the server between the last two timestamp messages, and d_(i) and d_(i−1) are differences between client computer time and server time when the messages arrived at the client computer, further comprising determining a running average, where samples of read bandwidth are combined into the running average, where a bandwidth limit is applied on the server in order to avoid network delays, where the bandwidth limit is computed using a feedback scheme, where the bandwidth limit is used to compute a lossy compression rate, where the lossy compression rate is calculated in order to achieve a desired interactive speed, where the feedback scheme uses a pseudo code, where the lossy compression rate is used to stream compressed images with a compression ratio, where images are streamed to the client computer using a buffering system, where the buffering system is based on monitoring user interaction and anticipating a next image that will be requested by the client computer, where the pseudo code includes an expression

  if (e > emax) then  if (number of bytes in line > threshold)  then   blimit := max(Factor1 * best, Factor2 * blimit)  end else if (bread > Factor3 * blimit) then  d := Factor4 * blimit * (emax − e)/emax  blimit := blimit + d end.

In an alternative embodiment of the present invention, a method of minimizing network latency comprises the steps of sending a request for image data from a client computer, including inserting one or more first timestamp messages into the request for image data at an application level, transferring the request and the one or more first timestamp messages to a server using a standard Transmission Communications Protocol (TCP) connection, receiving the one or more first timestamp messages at the server, calculating a first time for the one or more first timestamp messages sent from the client computer to the server, sending one or more second timestamp messages from the server to the client computer, sending the one or more second timestamp messages from the client computer to the server, receiving the one or more second timestamp messages from the client computer at the server, calculating a second time and subsequent times taken for the one or more second timestamp messages to be sent from the client computer to the server, calculating an estimate of current network bandwidth based on one or more of the first time, the second time and the subsequent times, determining situations where network latency will otherwise increase due to congestion and using the estimate of current network bandwidth to one or both reduce and modify client computer requests to minimize network latency, where information related to one or both the one or more first timestamp messages and the one or more second timestamp messages is not included in the one or more first timestamp messages and the one or more second timestamp messages, where the information is stored in a first in first out queue, so that the information can be evaluated when the one or more first timestamp messages and the one or more second timestamp messages arrives back at the server, where one or both the one or more first timestamp messages and the one or more second timestamp messages are used to compute read bandwidth (C/T), where C=c_(i)−c_(i−1), where C is an amount of data C that was read in a time between a last two timestamp messages (c_(i), c_(i−1)), and T=t_(i)−t_(i−1)+d_(i)−d_(i−1) is the time elapsed on the client computer between the last two timestamp messages, where t_(i)−t_(i−1) is the time elapsed on the server between the last two timestamp messages, and d_(i) and d_(i−1) are differences between client computer time and server time when the messages arrived at the client computer, further comprising determining a running average, where samples of read bandwidth are combined into the running average, where a bandwidth limit is applied on the server in order to avoid network delays, where the bandwidth limit is computed using a feedback scheme, where the bandwidth limit is used to compute a lossy compression rate, where the lossy compression rate is calculated in order to achieve a desired interactive speed, where the feedback scheme uses a pseudo code, where the lossy compression rate is used to stream compressed images with a compression ratio, where images are streamed to the client computer using a buffering system, where the buffering system is based on monitoring user interaction and anticipating a next image that will be requested by the client computer, where the bandwidth limit is used to compute the lossy compression rate, where the lossy compression rate is used to calculate a compression ratio, where one or more compressed images are streamed with the compression ratio, where a target compression quality is defined by a user, where a first image is streamed with a first compression quality, where the first compression quality minimizes network latency during interaction based on bandwidth monitoring and where the first image is streamed with a second compression quality when the user stops interacting, where the second compression quality is greater than the first compression quality if the first compression quality is lower than a target compression quality, where the pseudo code includes an expression

  if (e > emax) then  if (number of bytes in line > threshold)  then   blimit := max(Factor1 * best, Factor2 * blimit)  end else if (bread > Factor3 * blimit) then  d := Factor4 * blimit * (emax − e)/emax  blimit := blimit + d end.

In an alternative embodiment of the present invention, a method of minimizing network latency comprises the steps of sending a request for image data from a client computer, including inserting one or more first timestamp messages into the request for image data at an application level, transferring the request and the one or more first timestamp messages to a server using a standard Transmission Communications Protocol (TCP) connection, receiving the one or more first timestamp messages at the server, calculating a first time for the one or more first timestamp messages sent from the client computer to the server, sending one or more second timestamp messages from the server to the client computer, sending the one or more second timestamp messages from the client computer to the server, receiving the one or more second timestamp messages from the client computer at the server, calculating a second time and subsequent times taken for the one or more second timestamp messages to be sent from the client computer to the server, calculating an estimate of current network bandwidth based on one or more of the first time, the second time and the subsequent times, determining situations where network latency will otherwise increase due to congestion and using the estimate of current network bandwidth to one or both reduce and modify client computer requests to minimize network latency, where information related to one or both the one or more first timestamp messages and the one or more second timestamp messages is not included in the one or more first timestamp messages and the one or more second timestamp messages, where the information is stored in a first in first out queue, so that the information can be evaluated when the one or more first timestamp messages and the one or more second timestamp messages arrives back at the server, where one or both the one or more first timestamp messages and the one or more second timestamp messages are used to compute read bandwidth (C/T), where C=c_(i)−c_(i−1), where C is an amount of data C that was read in a time between a last two timestamp messages (c_(i), c_(i−1)), and T=t_(i)−t_(i−1)+d_(i)−d_(i−1) is the time elapsed on the client computer between the last two timestamp messages, where t_(i)−t_(i−1) is the time elapsed on the server between the last two timestamp messages, and d_(i) and d_(i−1) are differences between client computer time and server time when the messages arrived at the client computer, further comprising determining a running average, where samples of read bandwidth are combined into the running average, where a bandwidth limit is applied on the server in order to avoid network delays, where the bandwidth limit is computed using a feedback scheme, where the bandwidth limit is used to compute a lossy compression rate, where the lossy compression rate is calculated in order to achieve a desired interactive speed, where the feedback scheme uses a pseudo code, where the lossy compression rate is used to stream compressed images with a compression ratio, where images are streamed to the client computer using a buffering system, where the buffering system is based on monitoring user interaction and anticipating a next image that will be requested by the client computer, where the bandwidth limit is used to compute the lossy compression rate, where the lossy compression rate is used to calculate a compression ratio, where one or more compressed images are streamed with the compression ratio, where the pseudo code includes an expression

  if (e > emax) then  if (number of bytes in line > threshold)  then   blimit := max(Factor1 * best, Factor2 * blimit)  end else if (bread > Factor3 * blimit) then  d := Factor4 * blimit * (emax − e)/emax  blimit := blimit + d end.

In an alternative embodiment of the present invention, a method of minimizing network latency comprises the steps of sending a request for image data from a client computer, including inserting one or more first timestamp messages into the request for image data at an application level, transferring the request and the one or more first timestamp messages to a server using a standard Transmission Communications Protocol (TCP) connection, receiving the one or more first timestamp messages at the server, calculating a first time for the one or more first timestamp messages sent from the client computer to the server, sending one or more second timestamp messages from the server to the client computer, sending the one or more second timestamp messages from the client computer to the server, receiving the one or more second timestamp messages from the client computer at the server, calculating a second time and subsequent times taken for the one or more second timestamp messages to be sent from the client computer to the server, calculating an estimate of current network bandwidth based on one or more of the first time, the second time and the subsequent times, determining situations where network latency will otherwise increase due to congestion and using the estimate of current network bandwidth to one or both reduce and modify client computer requests to minimize network latency, where information related to one or both the one or more first timestamp messages and the one or more second timestamp messages is not included in the one or more first timestamp messages and the one or more second timestamp messages, where the information is stored in a first in first out queue, so that the information can be evaluated when the one or more first timestamp messages and the one or more second timestamp messages arrives back at the server, where one or both the one or more first timestamp messages and the one or more second timestamp messages are used to compute read bandwidth (C/T), where C=c_(i)−c_(i−1), where C is an amount of data C that was read in a time between a last two timestamp messages (c_(i), c_(i−1)), and T=t_(i)−t_(i−1)+d_(i)−d_(i−1) is the time elapsed on the client computer between the last two timestamp messages, where t_(i)−t_(i−1) is the time elapsed on the server between the last two timestamp messages, and d_(i) and d_(i−1) are differences between client computer time and server time when the messages arrived at the client computer, further comprising determining a running average, where samples of read bandwidth are combined into the running average, where a bandwidth limit is applied on the server in order to avoid network delays, where the bandwidth limit is computed using a feedback scheme, where the bandwidth limit is used to compute a lossy compression rate, where the lossy compression rate is calculated in order to achieve a desired interactive speed, where the feedback scheme uses a pseudo code, where the lossy compression rate is used to stream compressed images with a compression ratio, where images are streamed to the client computer using a buffering system, where the buffering system is based on monitoring user interaction and anticipating a next image that will be requested by the client computer, where the bandwidth limit is used to compute the lossy compression rate, where the pseudo code includes an expression

  if (e > emax) then  if (number of bytes in line > threshold)  then   blimit := max(Factor1 * best, Factor2 * blimit)  end else if (bread > Factor3 * blimit) then  d := Factor4 * blimit * (emax − e)/emax  blimit := blimit + d end.

In an alternative embodiment of the present invention, a method of minimizing network latency comprises the steps of sending a request for image data from a client computer, including inserting one or more first timestamp messages into the request for image data at an application level, transferring the request and the one or more first timestamp messages to a server using a standard Transmission Communications Protocol (TCP) connection, receiving the one or more first timestamp messages at the server, calculating a first time for the one or more first timestamp messages sent from the client computer to the server, sending one or more second timestamp messages from the server to the client computer, sending the one or more second timestamp messages from the client computer to the server, receiving the one or more second timestamp messages from the client computer at the server, calculating a second time and subsequent times taken for the one or more second timestamp messages to be sent from the client computer to the server, calculating an estimate of current network bandwidth based on one or more of the first time, the second time and the subsequent times, determining situations where network latency will otherwise increase due to congestion and using the estimate of current network bandwidth to one or both reduce and modify client computer requests to minimize network latency, where information related to one or both the one or more first timestamp messages and the one or more second timestamp messages is not included in the one or more first timestamp messages and the one or more second timestamp messages, where the information is stored in a first in first out queue, so that the information can be evaluated when the one or more first timestamp messages and the one or more second timestamp messages arrives back at the server, where one or both the one or more first timestamp messages and the one or more second timestamp messages are used to compute read bandwidth (C/T), where C=c_(i)−c_(i−1), where C is an amount of data C that was read in a time between a last two timestamp messages (c_(i), c_(i−1)), and T=t_(i)−t_(i−1)+d_(i)−d_(i−1) is the time elapsed on the client computer between the last two timestamp messages, where t_(i)−t_(i−1) is the time elapsed on the server between the last two timestamp messages, and d_(i) and d_(i−1) are differences between client computer time and server time when the messages arrived at the client computer, further comprising determining a running average, where samples of read bandwidth are combined into the running average, where a bandwidth limit is applied on the server in order to avoid network delays, where the bandwidth limit is computed using a feedback scheme, where the bandwidth limit is used to compute a lossy compression rate, where the lossy compression rate is calculated in order to achieve a desired interactive speed, where the feedback scheme uses a pseudo code, where the pseudo code includes an expression

  if (e > emax) then  if (number of bytes in line > threshold)  then   blimit := max(Factor1 * best, Factor2 * blimit)  end else if (bread > Factor3 * blimit) then  d := Factor4 * blimit * (emax − e)/emax  blimit := blimit + d end.

In an alternative embodiment of the present invention, a method of minimizing network latency comprises the steps of sending a request for image data from a client computer, including inserting one or more first timestamp messages into the request for image data at an application level, transferring the request and the one or more first timestamp messages to a server using a standard Transmission Communications Protocol (TCP) connection, receiving the one or more first timestamp messages at the server, calculating a first time for the one or more first timestamp messages sent from the client computer to the server, sending one or more second timestamp messages from the server to the client computer, sending the one or more second timestamp messages from the client computer to the server, receiving the one or more second timestamp messages from the client computer at the server, calculating a second time and subsequent times taken for the one or more second timestamp messages to be sent from the client computer to the server, calculating an estimate of current network bandwidth based on one or more of the first time, the second time and the subsequent times, determining situations where network latency will otherwise increase due to congestion and using the estimate of current network bandwidth to one or both reduce and modify client computer requests to minimize network latency, where information related to one or both the one or more first timestamp messages and the one or more second timestamp messages is not included in the one or more first timestamp messages and the one or more second timestamp messages, where the information is stored in a first in first out queue, so that the information can be evaluated when the one or more first timestamp messages and the one or more second timestamp messages arrives back at the server, where one or both the one or more first timestamp messages and the one or more second timestamp messages are used to compute read bandwidth (C/T), where C=c_(i)−c_(i−1), where C is an amount of data C that was read in a time between a last two timestamp messages (c_(i)−c_(i−1)), and T=t_(i)−t_(i−1)+d_(i)−d_(i−1) is the time elapsed on the client computer between the last two timestamp messages, where t_(i)−t_(i−1) is the time elapsed on the server between the last two timestamp messages, and d_(i) and d_(i−1) are differences between client computer time and server time when the messages arrived at the client computer, further comprising determining a running average, where samples of read bandwidth are combined into the running average, where a bandwidth limit is applied on the server in order to avoid network delays, where the bandwidth limit is computed using a feedback scheme, where the bandwidth limit is used to compute a lossy compression rate, where the feedback scheme uses a pseudo code, where the pseudo code includes an expression

  if (e > emax) then  if (number of bytes in line > threshold)  then   blimit := max(Factor1 * best, Factor2 * blimit)  end else if (bread > Factor3 * blimit) then  d := Factor4 * blimit * (emax − e)/emax  blimit := blimit + d end. 

What is claimed:
 1. A method comprising: (a) determining a congestion window comprising: inserting a first timestamp message into an application level request associated with a first image from a client computer; transferring the application level request and the first timestamp message to a server; receiving the first timestamp message at the server, where a first client time is calculated based at least on a time the first timestamp message is received, and calculating a send bandwidth at the server (b_(send)) in bytes/msec, where the first client time is used to compute b_(send); sending one or more second timestamp messages from the server to the client computer; receiving the one or more second timestamp messages at the client computer, where a second time (t_(i)) is calculated based at least on one or more times the one or more second timestamp messages are received; calculating a read bandwidth (b_(read)) in bytes/msec, where b_(read)=C/T, where C (total number of bytes that were sent from server to client)=c_(i)−c_(i−1), is an amount of data that was sent in a time between a last two second timestamp messages (c_(i), c_(i−1)), and T=t_(i)−t_(i−1)+d_(i)−d_(i−1), where T (server time in msec) is equal to the time elapsed on the server between the last two second timestamp messages (t_(i)−t_(i−1)), where d_(i) and d_(i−1) are client times and t_(i)−t_(i−1) are server times in the last two incoming timestamp messages of the one or more second timestamp messages; and assigning a congestion window equal to b_(read) when b_(send) is greater than or equal to 1.3×b_(read); and (b) using the congestion window to achieve one or both a high data throughput and a low latency.
 2. The method of claim 1, further comprising determining a running average of b_(read) based on a comparison between two or more second times.
 3. The method of claim 1, further comprising using a bandwidth limit applied on the server to reduce b_(send).
 4. The method of claim 3, where the bandwidth limit is computed using a feedback scheme.
 5. The method of claim 4, where the feedback scheme uses a pseudo code.
 6. The method of claim 5, where the pseudo code includes an expression   if (e > emax) then  if (number of bytes in line > threshold)  then   b_(limit) := max(Factor1 * b_(est), Factor2 * b_(limit))  end else if (b_(read) > Factor3 * b_(limit)) then  d := Factor4 * b_(limit) * (emax − e)/emax  b_(limit) := b_(limit) + d end.


7. The method of claim 1, further comprising assigning a congestion window equal to b_(read) when b_(send) is greater than or equal to 1.4×b_(read).
 8. The method of claim 1, further comprising discarding one or more contributions of outliers to b_(read) such that b_(read) does not exceed 2×b_(est).
 9. A method comprising: (a) determining a bandwidth limit comprising: inserting a first timestamp message into an application level request associated with a first image from a client computer; transferring the application level request and the first timestamp message to a server; receiving the first timestamp message at the server, where a first client time is calculated based at least on a time the first timestamp message is received, and calculating a send bandwidth at the server (b_(send)) in bytes/msec, where the first client time is used to compute b_(send); sending one or more second timestamp messages from the server to the client computer; receiving the one or more second timestamp messages at the client computer, where one or more second times (t_(i)) are calculated based at least on the time the one or more second timestamp messages are received; calculating a read bandwidth (b_(read)) in bytes/msec, where b_(read)=C/T, where C (total number of bytes that were sent from server to client)=c_(i)−c_(i−1), is an amount of data that was sent in a time between a last two second timestamp messages (c_(i), c_(i−1)), and T=t_(i)−t_(i−1)+d_(i)−d_(i−1), where T (server time in msec) is equal to the time elapsed on the server between the last two second timestamp messages (t_(i)−t_(i−1)), where d_(i) and d_(i−1) are client times and t_(i)−t_(i−1) are server times in the last two incoming timestamp messages of the one or more second timestamp messages; and assigning a bandwidth limit equal to sixty (60) percent of b_(read) when delay (e) exceeds 40 msec; and (b) using the bandwidth limit to limit b_(send) to achieve one or both a high data throughput and a low latency.
 10. The method of claim 9, further comprising determining a running average of b_(read) based on a comparison between two or more second times.
 11. The method of claim 9, where the bandwidth limit is computed using a feedback scheme.
 12. The method of claim 10, where the feedback scheme uses a pseudo code.
 13. The method of claim 12, further comprising where the pseudo code includes an expression   if (e > emax) then  if (number of bytes in line > threshold)  then   b_(limit) := max(Factor1 * b_(est), Factor2 * b_(limit))  end else if (b_(read) > Factor3 * b_(limit)) then  d := Factor4 * b_(limit) * (emax − e)/emax  b_(limit) := b_(limit) + d end.


14. The method of claim 9, further comprising increasing the bandwidth limit when e is less than 40 msec.
 15. The method of claim 9, further comprising increasing the bandwidth limit when the amount of data (c−c_(i)) is less than 10 KB.
 16. The method of claim 9, further comprising where the bandwidth limit is reduced by less than fifty (50) percent.
 17. A method comprising: (a) determining a bandwidth limit comprising: inserting a first timestamp message into an application level request associated with a first image from a client computer; transferring the application level request and the first timestamp message to a server; receiving the first timestamp message at the server, where a first client time is calculated based at least on a time the first timestamp message is received, and calculating a send bandwidth at the server (b_(send)) in bytes/msec, where the first client time is used to compute b_(send); sending one or more second timestamp messages from the server to the client computer; receiving the one or more second timestamp messages at the client computer, where one or more second times (t_(i)) are calculated based at least on the time the one or more second timestamp messages are received; calculating a read bandwidth (b_(read)) in bytes/msec, where b_(read)=C/T, where C (total number of bytes that were sent from server to client)=c_(i)−c_(i−1), is an amount of data that was sent in a time between a last two second timestamp messages (c_(i), c_(i−1)), and T (server time in msec)=t_(i)−t_(i−1)+d_(i)−d_(i−1), where d_(i) and d_(i−1) are client times and t_(i)−t_(i−1) are server times in the last two incoming timestamp messages of the one or more second timestamp messages; and assigning a bandwidth limit equal to sixty (60) percent of b_(read) when delay (e) exceeds   emax then  if (number of bytes in line > threshold)  then   b_(limit) := max(Factor1 * b_(est), Factor2 * b_(limit))  end else if (b_(read) > Factor3 * b_(limit)) then  d := Factor4 * b_(limit) * (emax − e)/emax  b_(limit) := b_(limit) + d end;

and (b) limiting b_(send) using the bandwidth limit to achieve one or both a high data throughput and a low latency.
 18. The method of claim 17, further comprising increasing the bandwidth limit when e is less than emax.
 19. The method of claim 17, further comprising increasing the bandwidth limit when the amount of data (c−c_(i)) is less than 10 KB.
 20. The method of claim 17, further comprising where the bandwidth limit is reduced by less than fifty (50) percent. 